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^ Optimi za ti on : Performance o ptimi z ation of singl e-phase level-se nsitiv e c irc u it s usi n g 
time borrowing and non-zero clock skew 
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issues in the specification and synthesis of digital systems 
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This paper describes a linear programming (LP) formulation for performance optimization of 
large-scale, synchronous circuits with level-sensitive latches. The proposed formulation 
permits circuits to operate at a higher clock frequency— that is, with a lower clock period— 
by the application of both non-zero clock skew scheduling [7] and time borrowing [9], This 
LP formulation is computationally efficient and demonstrates significant circuit performance 
improvement. Unlike the approach document ... 
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High speed synchronous digital systems require large switching currents to facilitate rapid 
signal transitions. These large currents create voltage drops on the power distribution 
network and necessitate expensive chip packaging with a large number of supply pins. In 
this paper we propose a novel technique to reduce the dynamic transient current drawn 
from the supply pins. Our approach is based on sub-dividing the synchronous clocking into 
multiple sub-clocks with relative skew. This spreads the ... 
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For many years, CMOS process scaling has allowed a steady increase in the operating 
frequency and integration density of integrated circuits. Only recently, however, have we 
reached a point where it takes several clock cycles for global signals to traverse a complex 
digital system such as a modern microprocessor. Thus, interconnect latency must be taken 
into account in current and future design tools at the architectural as well as synthesis level. 
To this purpose, this work proposes a new laten ... 
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term s 

Low power has emerged as a principal theme in today's electronics industry. The need for 
low power has caused a major paradigm shift in which power dissipation is as important as 
performance and area. This article presents an in-depth survey of CAD methodologies and 
techniques for designing low power digital CMOS circuits and systems and describes the 
many issues facing designers at architectural, logical, and physical levels of design 
abstraction. It reviews some of the techniques and tool ... 

Keywords: CMOS circuits, adiabatic circuits, computer-aided design of VLSI, dynamic 
power dissipation, energy-delay product, gated clocks, layout, low power layout, low power 
synthesis, lower-power design, power analysis and estimation, power management, power 
minimization and management, probabilistic analysis, silicon-on-insulator technology, 
statistical sampling, switched capacitance, switching activity, symbolic simulation, synthesis, 
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The quadratic relationship between voltage and energy has nnade dynamic voltage scaling 
(DVS) one of the most powerful techniques to reduce system power demands. Recently, 
techniques such as Razor DVS, voltage overscaling, and Intelligent Energy Management 
have emerged as approaches to further reduce voltage by eliminating costly voltage margins 
inserted into traditional designs to ensure always-correct operation. The degree to which a 
global voltage controller can shave voltage margin ... 

Keywords: dynamic retiming with global DVS, local DVS, razor 
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In this paper, we investigate the problem of buffer sizing for clock power minimization 
subject to general skew constraints. A novel approach based on sequential linear 
programming is presented. By taking the first-order Taylor's expansion of clock path delay 



http://portaLacm.org/resultsxfm?coll=ACM&dl=ACM&CFID=28108l49& 9/27/04 



Results (page 1): +programmable -fdelay +lines +flip +flop +hold +time +clock H-skew 



Page 3 of 6 



with respect to buffer widths, the original nonlinear problem is transformed to a sequence of 
linear programs, which incorporate clock skew scheduling and buffer sizing to minimize clock 
power dissipation. For each linear program, the sensi ... 

Keywords: clock skew scheduling, sequential linear programming, sizing 
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Razor: A Low-Power Pipeli ne B a sed on Circuit-Le v el Ti ming S peculation 

Dan Ernst, Nam Sung Kim, Shidhartha Das, Sanjay Pant, Rajeev Rao, Toan Pham, Conrad 

Ziesler, David Blaauw, Todd Austin, Krisztian Flautner, Trevor Mudge 

December 2003 Proceedings of the 36th Annual IEEE/ACM International Symposium on 
Microarchitecture 

Full text available: SI Bdf(568. 17 KB) 
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With increasing clock frequencies and silicon integration, power aware computing has 
become a critical concernin the design of embedded processors and systems-on-chip. One of 
the more effective and widely used methods for power-awarecomputing is dynamic voltage 
scaling (DVS). In orderto obtain the maximum power savings from DVS, it is essentialto 
scale the supply voltage as low as possible while ensuringcorrect operation of the processor. 
The critical voltage ischosen such that under a worst-case ... 

UST/DME: a clock tree ro ut er for general s k e w con s tra ints 
Chung-wen Albert Tsao, Cheng-kok Koh 

July 2002 ACM Transactions on Design Automation of Electronic Systems (TODAES), 

Volume 7 Issue 3 

Full text available* HI df(260 87 KB) Additional Information: f ull c ita t i on , abstrac t, r eferences , ci tin g s , index 
" ^ ' terms 

In this article, we propose new approaches for solving the useful-skew tree (UST) routing 
problem [Xi and Dai 1997]: clock routing subject to general skew constraints. The clock 
layout synthesis engine of our UST algorithnns is based on the deferred-merge embedding 
(DME) paradigm for the zero-skew tree (ZST) [Edahiro 1992; Chao et al. 1992] and 
bounded-skew tree (BST) [Cong and Koh 1995; Huang et al. 1995; Kahng and Tsao 1997; 
Cong et al. 1998] routings; hence, the names UST/DME and Greedy-UST/DM ... 

Keywords: Useful Skew, clock tree, feasible skew range, incremental skew scheduling, 
merging and embedding, merging region 
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Field programmable gate arrays 
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terms 

Wave-steering is a new design methodology that realizes high throughput circuits by 
embedding layout friendly synthesized structures in silicon. In the wave-steering design 
methodology, cir6cuits inherently utilize latches. Inside the synthesized structures they are 
used for signal skewing, and on the interconnects to guaraantee the correct arrival tinnes at 
the inputs. Recently, we proposed a novel high-throughput FPGA architecture based on the 
wave-steering design principle to handle throu ... 
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electronics and design 
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We have recently designed, fabricated, and successfully tested an experimental chip that 
validates a novel nnethod for reducing clock dissipation through energy recovery. Our 
approach includes a single-phase sinusoidal clock signal, an L-C resonant sinusoidal clock 
generator, and an energy recovering flip-flop. Our chip comprises a dual-mode ASIC with 
two independent clock systems, one conventional and one energy recovering, and was 
fabricated in a micron bulk CMOS process. The ASIC computes a pi ... 

Keywords: CMOS, VLSI, adiabatic logic, clock generator, flip-flop, low energy, resonant LC 
tank, single phase 
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With increased logic density due to the shift towards Deep Submicron technologies (DSM), 
FPGAs have become a viable option for implementing large designs. However, most 
commercial FPGAs, due to their general purpose architectural nature, cannot handle designs 
which require very high throughput. In this paper, we propose a novel high throughput FPGA 
architecture which tries to combine the high-performance of Application Specific Integrated 
Circuits (ASICs) and the flexibility afforded by the ... 
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Due to its potential to greatly accelerate a wide variety of applications, reconfigurable 
computing has become a subject of a great deal of research. Its key feature is the ability to 
perform computations in hardware to increase performance, while retaining much of the 
flexibility of a software solution. In this survey, we explore the hardware aspects of 
reconfigurable computing machines, from single chip architectures to multi-chip systems, 
including internal structures and external coupling. W ... 
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Fast, efficient logic simulators are an essential tool in modern VLSI system design. Logic 
simulation is used extensively for design verification prior to fabrication, .and as VLSI 
systems grow in size, the execution time required by simulation is becoming more and more 
significant. Faster logic simulators will have an appreciable economic impact, speeding time 
to market while ensuring more thorough system design testing. One approach to this 
problem is to utilize parallel processing, taking ... 
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We introduce the associative skew clock routing problem, which seeks a clock routing tree 
such that zero skew is preserved only within identified groups of sinks. The associative skew 
problem is easier to address within current EDA frameworks than useful-skew (skew- 
scheduling) approaches, and defines an interesting tradeoff between the traditional zero- 
skew clock routing problem (one sink group) and the Steiner minimum tree problem (n 
sink ... 
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